A Coreference Resolution Approach using Morphological Features in Arabic
نویسندگان
چکیده
Coreference resolution is considered one of the challenges in natural language processing. It is an important task that includes determining which pronouns are referring to which entities. Most of the earlier approaches for coreference resolution are rule-based or machine learning approaches. However, these types of approaches have many limitations especially with Arabic language. In this paper, a different approach to coreference resolution is presented. The approach uses morphological features and dependency trees instead. It has fivestages, which overcomes the limitations of using annotated datasets for learning or a set of rules. The approach was evaluatedusing our own customized annotated dataset and “AnATAr” dataset. The evaluation show encouraging results with average F1 score of 89%. Keywords—Coreference resolution; Anaphora; Alternative Approach; Arabic NLP; morphological features
منابع مشابه
Corefrence resolution with deep learning in the Persian Labnguage
Coreference resolution is an advanced issue in natural language processing. Nowadays, due to the extension of social networks, TV channels, news agencies, the Internet, etc. in human life, reading all the contents, analyzing them, and finding a relation between them require time and cost. In the present era, text analysis is performed using various natural language processing techniques, one ...
متن کاملCorpus based coreference resolution for Farsi text
"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...
متن کاملMulti-Lingual Coreference Resolution With Syntactic Features
In this paper, we study the impact of a group of features extracted automatically from machine-generated parse trees on coreference resolution. One focus is on designing syntactic features using the binding theory as the guideline to improve pronoun resolution, although linguistic phenomenon such as apposition is also modeled. These features are applied to the Arabic, Chinese and English corefe...
متن کاملCoreference Resolution of Named Entities and Noun Phrases in Web Pages
An approach for intra-document coreference resolution of named entities and noun phrases is proposed. This approach is a knowledgepoor, integrated approach to coreference resolution which relies on syntactic, discourse and semantic information (using WordNet). Our approach is also intended to exploit the structural features of web pages for the purposes of discourse analysis. This research is i...
متن کاملMachine Learning for Mention Head Detection in Multilingual Coreference Resolution
This work introduces a machine learning approach to the identification of mention heads needed for multilingual coreference resolution (MCR). We evaluate the method and compare it to a heuristic baseline and a rule-based approach, which are widely used in coreference resolution systems. We use the CoNLL-2012 shared task data sets, which include data for Arabic, Chinese, and English. We show tha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016